fix(api-v1): return real task.status in CreateTaskResponse, not hardcoded "pending" by AlexLiu190625 · Pull Request #426 · xorbitsai/xagent

AlexLiu190625 · 2026-05-18T11:19:49Z

Summary

Two related response-correctness fixes for the v1 SDK API surface — both make the task row reflect the real outcome instead of a hardcoded placeholder.

Fix 1 — `status` field on `POST /v1/chat/tasks`

POST /v1/chat/tasks returned status="pending" in the response body even though begin_turn had already atomically claimed the row as RUNNING inside the same handler before the response was sent. An SDK client doing POST followed by an immediate GET on the same task would see two contradictory status values from back-to-back calls (pending then running).

Reads task.status.value after begin_turn returns. begin_turn refreshes the in-memory task after committing the atomic claim, so the post-handler view reflects the row's real state. AppendMessageResponse already followed this contract; the create path now matches it, and the create-side response-string was unified to read from the refreshed row so both endpoints use the same expression form.

Fix 2 — `task.error_message` populated with real exception text

execute_task_background previously logged and broadcast the exception text on failure but never wrote it to task.error_message. The row's status stayed RUNNING, and finish_turn's RUNNING-fallback branch then unconditionally set error_message to a generic placeholder ("Task execution failed without status update; see /steps."), forcing SDK and web clients to fetch /steps to discover what actually went wrong.

The fix writes the real exception text to task.error_message and flips status to FAILED in the exception handler, using a fresh session because the original may be in a failed-transaction state. finish_turn's FAILED branch then only fills in a placeholder when error_message is empty, so the real message is preserved through to the final row state.

Changes

src/xagent/web/api/v1/tasks.py:
- POST /v1/chat/tasks reads task.status.value instead of hardcoded "pending"; brief comment mirrors the equivalent block at the AppendMessageResponse return site.
- POST /v1/chat/tasks/{id}/messages switched to the same task.status.value pattern so the two endpoints are symmetric (was hardcoded "running").
- create_chat_task Returns: docstring updated to describe the post-claim status='running' contract.
src/xagent/web/schemas/v1.py: CreateTaskResponse and AppendMessageResponse field docstrings updated — both previously claimed "always 'pending'", both now describe the post-claim running semantics.
src/xagent/web/api/websocket.py: execute_task_background exception handler now writes str(e)[:4000] to task.error_message and flips status to FAILED in a fresh session before broadcasting the error event.
tests/web/api/v1/test_tasks.py: test_create_task_happy_path was asserting the old "pending" value and has been updated to assert "running", with a comment explaining the post-claim semantics.

Why not a regression risk

No behavior change in orchestrator, scheduler, or DB transaction edges.
Only the persisted-row content changes for the status and error_message columns; both are now accurate where they were previously a placeholder.
AppendMessageResponse already returned "running" for the same reason, so SDK clients tolerating that one will tolerate the create-side identically.
Fix 2 uses a fresh session, so an in-flight failed transaction on the original session can't block the error persist.

Test plan

pytest tests/web/api/v1/ tests/web/services/ tests/web/api/test_agent_api_keys.py — 92 passed
pre-commit run --files <touched files> — ruff / mypy / isort / codespell green
Manual review confirms finish_turn's FAILED branch preserves the populated error_message (only writes a placeholder when empty).

… "pending" POST /v1/chat/tasks returned ``status="pending"`` in the response body even though ``begin_turn`` had already atomically claimed the row as RUNNING inside the same handler before the response was sent. An SDK client doing POST followed by an immediate GET on the same task would see two contradictory status values from back-to-back calls (``pending`` then ``running``). Read ``task.status.value`` after ``begin_turn`` returns instead -- ``begin_turn`` refreshes the in-memory ``task`` after committing the atomic claim, so the post-handler view reflects the row's real state. ``AppendMessageResponse`` already followed this contract; the create path now matches it. Schema docstrings for CreateTaskResponse and AppendMessageResponse updated to describe the post-claim ``running`` semantics; the ``test_create_task_happy_path`` assertion was checking for the old ``pending`` value and has been updated. No behavior change in the orchestrator or scheduler -- this is a response-payload correctness fix only. Twenty-eight v1 task tests pass; pre-commit (ruff / mypy / isort / codespell) green.

gemini-code-assist

Code Review

This pull request updates the task creation API to return the actual task status (typically 'running') instead of a hardcoded 'pending' value, ensuring consistency between the response and the database state. Documentation and tests have been updated to reflect this change. The reviewer suggested further improving consistency by replacing hardcoded status strings with the Enum value in other response models like AppendMessageResponse.

qinxuye · 2026-05-18T11:38:29Z

The title v1 is distracting, maybe call it api v1 or sth.

…nse status read Follow-up to the CreateTaskResponse status fix in the previous commit. Two consistency cleanups within the same handler module: 1. ``create_chat_task`` function docstring still described the return value as ``status='pending'`` -- inconsistent with the schema docstring and the implementation that the previous commit already moved to ``task.status.value`` (i.e. 'running'). Updated the Returns section to match. 2. ``append_message_to_task`` returned ``status="running"`` hard- coded while ``create_chat_task`` reads ``task.status.value``. Unified both endpoints on the same pattern -- reading from the refreshed in-memory row is defensive (any future ``begin_turn`` status-machine change is picked up automatically) and removes the asymmetry where one endpoint reflected the DB and the other asserted a fixed string. No behavior change for the current contract -- the previous commit already returned 'running' from CREATE; this commit makes APPEND share the same expression form and fixes the docstring drift. Twenty-eight v1 task tests pass; pre-commit green.

AlexLiu190625 · 2026-05-18T12:16:15Z

Renamed the scope from v1 to api-v1 to point at the API surface directly — agreed it was ambiguous. Commit messages in-branch still say fix(v1): since the PR is squash-merged and the final commit picks up the new title.

…lure ``execute_task_background`` previously logged and broadcast the exception text on failure but never wrote it to ``task.error_message``. The row's status stayed RUNNING, and ``finish_turn``'s RUNNING-fallback branch then unconditionally set ``error_message`` to a generic placeholder ("Task execution failed without status update; see /steps."), forcing SDK and web clients to fetch ``/steps`` to discover what actually went wrong. The fix writes the real exception text to ``task.error_message`` and flips ``status`` to ``FAILED`` in the exception handler, using a fresh session because the original may be in a failed-transaction state. ``finish_turn``'s FAILED branch then only fills in a placeholder when ``error_message`` is empty, so the real message is preserved through to the final row state. Same family of fix as the ``status="pending"`` correction above: make the persisted task row reflect the real outcome rather than a generic placeholder. Affects both SDK consumers (GET /v1/chat/tasks/ {id}) and web/WebSocket consumers reading the same row. Twenty-eight v1 task tests pass; pre-commit (ruff / mypy / isort / codespell) green.

rogercloud

I found one issue in the failure handling path.

…site execute_task_background's outer except spans post-terminal steps (assistant-message persistence and the completion/paused broadcasts) that run after the task status was already committed COMPLETED. The recently added FAILED-persistence wrote the row unconditionally, so a failure in one of those best-effort steps -- e.g. the completion broadcast losing its websocket -- rewrote an already-completed task as FAILED and stored the broadcast error in error_message. Branch the handler on the task's current status instead. Only a task still RUNNING is a genuine execution failure: record the real exception text, flip to FAILED, and emit task_error. A task already in a terminal state tripped here in a best-effort post-completion step, so observe it without touching the row or emitting a contradictory task_error; finish_turn still reconciles the terminal fields afterward.

Single content conflict in websocket.py's execute_task_background failure handler. Adopt upstream's _terminal_task_error_payload (which persists FAILED + the real error_message and builds the notification payload) as the genuine-failure path, but gate it on the task still being RUNNING. The outer except also spans post-completion best-effort steps (assistant-message persistence, completion/paused broadcasts); a failure there must not rewrite an already-terminal task as FAILED or emit a contradictory task_error -- it is logged and the row is left for finish_turn to reconcile.

rogercloud

Follow-up on the narrowed failure boundary.

The success path committed COMPLETED/FAILED before persisting the assistant message, which is a separate durable write. If that write failed, the row was left COMPLETED with no message and no error_message -- the status-gated failure handler treated it as a best-effort post-completion step and left it untouched. Leave the terminal status pending and let persist_assistant_message's commit land it atomically with the message. A failure in that durable write now leaves the status RUNNING, so the outer handler surfaces a real task failure instead of a contradictory empty COMPLETED row. Only notification broadcasts remain best-effort.

The previous change rode the terminal-status commit on persist_assistant_message's internal commit. But that helper early-returns without committing when the assistant content is empty (a valid empty-reply turn), which left the status pending -> RUNNING -> finish_turn flipping a successful empty turn to FAILED. Add an explicit commit after persistence. It lands the terminal status whether or not a message row was written, while still surfacing a real failure when persistence raises (control never reaches the explicit commit, so the status stays uncommitted and the outer except fails it).

rogercloud

Reviewed the current head c0aa960. No findings; targeted local tests and GitHub checks are green.

XprobeBot added the bug Something isn't working label May 18, 2026

gemini-code-assist Bot reviewed May 18, 2026

View reviewed changes

Comment thread src/xagent/web/api/v1/tasks.py

AlexLiu190625 changed the title ~~fix(v1): return real task.status in CreateTaskResponse, not hardcoded "pending"~~ fix(api-v1): return real task.status in CreateTaskResponse, not hardcoded "pending" May 18, 2026

AlexLiu190625 mentioned this pull request May 18, 2026

Worker event loop blocked 25–30s during agent task initialization (concurrent capacity cliff) #427

Open

qinxuye requested a review from rogercloud May 22, 2026 07:56

rogercloud reviewed Jun 1, 2026

View reviewed changes

Comment thread src/xagent/web/api/websocket.py Outdated

AlexLiu190625 added 2 commits June 1, 2026 11:25

AlexLiu190625 requested a review from rogercloud June 1, 2026 04:10

rogercloud reviewed Jun 1, 2026

View reviewed changes

Comment thread src/xagent/web/api/websocket.py

AlexLiu190625 added 2 commits June 1, 2026 16:36

AlexLiu190625 requested a review from rogercloud June 1, 2026 09:05

rogercloud approved these changes Jun 2, 2026

View reviewed changes

qinxuye merged commit fc738ec into xorbitsai:main Jun 2, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(api-v1): return real task.status in CreateTaskResponse, not hardcoded "pending"#426

fix(api-v1): return real task.status in CreateTaskResponse, not hardcoded "pending"#426
qinxuye merged 7 commits into
xorbitsai:mainfrom
AlexLiu190625:fix/v1-create-task-response-status-mismatch

AlexLiu190625 commented May 18, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

qinxuye commented May 18, 2026

Uh oh!

AlexLiu190625 commented May 18, 2026

Uh oh!

rogercloud left a comment

Uh oh!

Uh oh!

rogercloud left a comment

Uh oh!

Uh oh!

rogercloud left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

AlexLiu190625 commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Fix 1 — status field on POST /v1/chat/tasks

Fix 2 — task.error_message populated with real exception text

Changes

Why not a regression risk

Test plan

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

qinxuye commented May 18, 2026

Uh oh!

AlexLiu190625 commented May 18, 2026

Uh oh!

rogercloud left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rogercloud left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rogercloud left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

AlexLiu190625 commented May 18, 2026 •

edited

Loading

Fix 1 — `status` field on `POST /v1/chat/tasks`

Fix 2 — `task.error_message` populated with real exception text